Cadec: A corpus of adverse drug event annotations

نویسندگان

  • Sarvnaz Karimi
  • Alejandro Metke-Jimenez
  • Madonna Kemp
  • Chen Wang
چکیده

CSIRO Adverse Drug Event Corpus (Cadec) is a new rich annotated corpus of medical forum posts on patient-reported Adverse Drug Events (ADEs). The corpus is sourced from posts on social media, and contains text that is largely written in colloquial language and often deviates from formal English grammar and punctuation rules. Annotations contain mentions of concepts such as drugs, adverse effects, symptoms, and diseases linked to their corresponding concepts in controlled vocabularies, i.e., SNOMED Clinical Terms and MedDRA. The quality of the annotations is ensured by annotation guidelines, multi-stage annotations, measuring inter-annotator agreement, and final review of the annotations by a clinical terminologist. This corpus is useful for studies in the area of information extraction, or more generally text mining, from social media to detect possible adverse drug reactions from direct patient reports. The corpus is publicly available at https://data.csiro.au.(1).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recognizing Mentions of Adverse Drug Reaction in Social Media Using Knowledge-Infused Recurrent Models

Recognizing mentions of Adverse Drug Reactions (ADR) in social media is challenging: ADR mentions are contextdependent and include long, varied and unconventional descriptions as compared to more formal medical symptom terminology. We use the CADEC corpus to train a recurrent neural network (RNN) transducer, integrated with knowledge graph embeddings of DBpedia, and show the resulting model to ...

متن کامل

A semi-supervised learning framework for biomedical event extraction based on hidden topics

OBJECTIVES Scientists have devoted decades of efforts to understanding the interaction between proteins or RNA production. The information might empower the current knowledge on drug reactions or the development of certain diseases. Nevertheless, due to the lack of explicit structure, literature in life science, one of the most important sources of this information, prevents computer-based syst...

متن کامل

Making adjustments to event annotations for improved biological event extraction

BACKGROUND Current state-of-the-art approaches to biological event extraction train statistical models in a supervised manner on corpora annotated with event triggers and event-argument relations. Inspecting such corpora, we observe that there is ambiguity in the span of event triggers (e.g., "transcriptional activity" vs. 'transcriptional'), leading to inconsistencies across event trigger anno...

متن کامل

مقایسه روشهای اپیدمیولوژیک در شناسایی سیگنالهای عوارض دارویی ایران

Background and Objectives:To compare three different methods of signal detection applied to the Adverse Drug Reactions registered in the Iranian Pharmacovigilance database from 1998 to 2005. Materials and Methods:All Adverse Drug Reactions (ADRs) reported to Iranian Pharmacovigilance Center from March 1998 through January 2005, were included in the analysis. The data were analyzed based on thre...

متن کامل

طراحی و روش نمونه‌گیری مطالعه آگاهی، نگرش و عملکرد خانوارها و کارکنان بهداشتی در خصوص تغذیه و ریزمغذیها در استانهای پایلوت برنامه

Background and Objectives:To compare three different methods of signal detection applied to the Adverse Drug Reactions registered in the Iranian Pharmacovigilance database from 1998 to 2005. Materials and Methods:All Adverse Drug Reactions (ADRs) reported to Iranian Pharmacovigilance Center from March 1998 through January 2005, were included in the analysis. The data were analyzed based on thr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of biomedical informatics

دوره 55  شماره 

صفحات  -

تاریخ انتشار 2015